README

This data set was established in the second round of the SANTA shared task. For details about the process, please checkout the corresponding special issue, published in the journal Cultural Analytics.

Annotation Process 

The annotation has been done with the annotation tool CorefAnnotator, available under the DOI 10.5281/zenodo.1228105. Please note that the annotation was done with version 1.14.3 of the tool.

The annotation was done in parallel by two annotators, who are named A1 and A2 in the files provided here. A1 for one guideline, however, is not the same person as A1 for another guideline.

File Formats

xmi: This package provides two file formats. The original file format that can directly be opened with the annotation tool is located in the folder 'xmi'.

csv: An export of the annotations, such that they can be used fed into the Gamma tool for calculating inter-annotator agreement (10.1162/COLI_a_00227). Please note that these CSV files are different from the ones that can be exported from CorefAnnotator.

txt: The plain text files that were the basis of the annotation.

For questions, please contact nils.reiter@uni-koeln.de.